Autoregressive hidden Markov model for applied tasks of vocal fold pathology detection
نویسندگان
چکیده
The recognition and training algorithms for autoregressive hidden Markov models were developed in order to solve the task of vocal fold pathology detection. Three databases were created and used for 3 vocal pathologies detection. During the experiments the proposed vocal tract pathology detection system based on autoregressive hidden Markov models and wide-range AFT mel-spectrum provides very high detection accuracy. INTRODUCTION The analysis of speaker individuality is widely used for different tasks, for example, speaker identification and verification, diagnostics of speech producing organs, secure access [1 ], [2]. It is well known that if there is the presence of vocal fold pathology, significant changes can occur that alter the speech production system, resulting in deterioration of voice quality. Analysing such changes we can make a conclusion about the state of person's vocal fold and compare it with the template health person parameters. Subjective testing made by a physician is not able to detect pathology on the earlier stages, except this such testing strictly depends on the physician experience. There is a possibility to perform such analysis objectively based on more or less nonlinear model. From this point of view systems based on hidden Markov models (HMM) are very attractive. Flexible and powerful mathematical apparatus of these models lets to use them for effective temporal information modelling. Speech signal by its nature has two aspects. Firstly, speech of the person is defined by physical parameters, such as vocal tract length, glottal size and so on. Secondly, the speech producing is impossible without neural control of the articulators, which defines the personal learned abilities such as dialect or regional accents, pronunciation, speed and timing of the articulators. For better analysis of speaker pronunciation peculiarities it is necessary to take into consideration interrelations between close frames of the same phonetic unit and loose important information about acoustical structure of the phoneme. In connection with this we introduce an autoregressive hidden Markov model for the task of vocal tract pathology detection, which is similar to the task speaker identification within the framework of such approach [3]. The character vector used for speech analysis greatly influences the voice analysis performance. In order to provide the high accuracy we have to use apriory knowledge about human speech producing and perception. In particular, the knowledge of main psycho-acoustical principles gives us the possibility to cut information not grasping by human ear. Information about high frequencies in the speech signal is very important also, and it is rather reasonable to use it for vocal fold pathology detection. SPEECH PARAMETERS FOR VOCAL FOLD PATHOLOGY DETECTION Now human ear is the best tool for speech analysis and it is not a bad idea to model it in order to solve different speech analysis tasks. The usage of psycho-acoustical principles lets to form speech parameters that reflect all essential speech features. Psycho-acoustical parameters include absolute threshold of hearing, critical bands, simultaneous and temporal masking [4]. Here we briefly review the psycho-acoustical parameters in order to include them to the speech feature vector estimation process. The absolute threshold of hearing is characterized by the amount of energy needed in a pure tone such that it can be detected by a listener in a noiseless environment. The threshold is well approximated by the nonlinear function ~(!}=3.6{!/lOO)r ~~f +10"(!/100~ (1) MAVEBA 1999, Firenze, Italy 108 Models and Analysis of Vocal Emissions for Biomedical Applications
منابع مشابه
A HMM-Based Method for Vocal Fold Pathology Diagnosis
Acoustic analysis is a proper method in vocal fold pathology diagnosis so that it can complement and in some cases replace the other invasive, based on direct vocal fold observation’s methods. There are different approaches for vocal fold pathology diagnosis. This paper presents a method based on hidden markov model which classifies speeches into two classes: the normal and the pathological. Tw...
متن کاملA HTK-based Method for Detecting Vocal Fold Pathology
INTRODUCTION In recent years a number of methods based on acoustic analysis were developed for vocal fold pathology detection. These methods can be categorized in two categories:a) detection based on the phonemes b) detection based on the continuous speeches. While there are many researches which belong to the first category, there are few efforts for detecting vocal fold pathology based on the...
متن کاملIntrusion Detection Using Evolutionary Hidden Markov Model
Intrusion detection systems are responsible for diagnosing and detecting any unauthorized use of the system, exploitation or destruction, which is able to prevent cyber-attacks using the network package analysis. one of the major challenges in the use of these tools is lack of educational patterns of attacks on the part of the engine analysis; engine failure that caused the complete training, ...
متن کاملA novel hybrid method for vocal fold pathology diagnosis based on russian language
In this paper, first, an initial feature vector for vocal fold pathology diagnosis is proposed. Then, for optimizing the initial feature vector, a genetic algorithm is proposed. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different classifiers (ensemble of decision tree, discriminant analysis and K-nearest neig...
متن کاملEvaluation of the Hidden Markov Model for Detection of P300 in EEG Signals
Introduction: Evoked potentials arisen by stimulating the brain can be utilized as a communication tool between humans and machines. Most brain-computer interface (BCI) systems use the P300 component, which is an evoked potential. In this paper, we evaluate the use of the hidden Markov model (HMM) for detection of P300. Materials and Methods: The wavelet transforms, wavelet-enhanced indepen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999